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Abstract 

We study bispecial factors in fixed points of morphisms. In particular, 
we propose a simple method of how to find all bispecial words of non-pushy 
circular DOL-systems. This method can be formulated as an algorithm. 
Moreover, we prove that non-pushy circular DOL-systems are exactly those 
with finite critical exponent. 

keywords: bispecial factors, circular DOL systems, non-pushy DOL systems, critical 
exponent 

1 Introduction 

Bispecial factors proved to be a powerful tool for better understanding of complexity 
of aperiodic sequences of symbols from a finite set. One of the most studied families 
of such sequences are fixed points of morphisms. In this paper we present a method 
of how to describe the structure of all bispecial factors in a given fixed point. 

The method we describe here can be partially spotted in results of several authors: 
It is a sort of inverse of the algorithm by Cassaigne from paper [I] which is concerned 
by pattern avoidability. Very similar approach was used in [2] by Avgustinovich and 
Frid and in [3] by Frid to describe bispecial factors of biprefix circular morphisms and 
marked uniform morphisms, respectively. Actually, the fact that all bispecial factors 
in a fixed point can be generated as members of some easily constructed sequences 
was noticed in many papers where factor complexity was computed, see, e.g., [HE]. 
In this paper we formalize this approach and prove that it works for a very wide class 
of morphisms, namely non-pushy and circular morphisms. Moreover, it seems that 
the assumptions we need for proofs can be weakened or even omitted and the main 
theorems remain true. 

The paper is organized as follows. In the next section we introduce necessary 
notation and notions and also explain the importance of bispacial factors. Since it is 
easier to explain the main result using examples than to formulate it as a theorem, we 
do so in Section [3] Section [4] contains proofs of the crucial theorems and in Section [5] 
we explain how to use our results to identify immediately all infinite special branches. 
In Section [6] we prove that non-pushy and circular morphisms are exactly those whose 
fixed points have finite critical exponent. 

2 Preliminaries 

Let A — {0, 1, . . . , n — 1}, n > 2, be a finite alphabet of n letters; if needed, we denote 
this particular n- letter alphabet as A n . An infinite word over the alphabet A is a 
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sequence u = (wi)»>i where Ui G A for all i > 1. If v = UjUj+i ■ ■ ■ Uj+ n -i, j, n > 1, 
then « is said to be a factor of u of length n, the empty word e is the factor of length 
0. The set of all finite words over A is the free monoid A* , the set of nonempty finite 
words is denoted by A + = A* \ {e}. 

A map p : A* — s> .4* is called a morphism if tp{wv) — p(w)p(v) for every w, v G .4*. 
Any morphism <p is uniquely given by the set of images of letters tp(a),a G A. If all 
these images are nonempty words, the morphism is called non-erasing. A famous 
example of a morphism is the Thue-Morse morphism ptm defined by 

^tm(O) = 01, 
^tm(I) = 10. 

This paper studies infinite fixed points of morphisms: an infinite word w is a fixed 
point of a morphism ip if p>(w) = w. If <p e (w) = w for some positive i, w is a periodic 
point of (p. The fixed point of <ptm beginning in the letter is the infinite word 

u TM = Um ¥>? M (0) = Vtm(0) = 0110100110 • • • , (1) 

n — >oo 

which is called the Thue-Morse word. 

An infinite word u is aperiodic if it is not eventually periodic, i.e., there are no 
finite words v and to such that u = vwwww ■ • ■ = wui". If a word u — vw, then d is a 
prefix of it and w is its suffix. In this case we put (v)~ 1 u = w and u(w)~ 1 = u. Given 
a morphism <p on A, if <£>(a) is not a suffix of p(b) for any distinct a, b 6 A, then is 
said to be suffix-free. Prefix-free morphisms are defined analogously. 

The language of a fixed point u is the set of all its factors and is denoted by £(u). 
When speaking about a morphism, we usually mean a morphism together with its 
particular infinite fixed point. But a morphism can have more than one fixed point 
and not all of them must have the same language (this is true if the morphism is 
primitive). For instance, consider the morphism h-> 010, 1 i — >• 1 1 : it has two fixed 
points, one aperiodic starting in and one periodic starting in 1. Therefore, instead 
of speaking only about a morphism we will always speak about a morphism and its 
particular infinite fixed point. A well-established way of how to do so is to treat a 
morphism and its fixed point as a DOL-system (see, e.g., [6] and [?])■ 

Definition 1. A triplet G = (A, ip, w) is called a DOL-system, where A is an alphabet, 
Lp a morphism on A, and w £ A + is an axiom. The language of G denoted by C(G) 
is the set of all factors of the words ip n (w), n — 0, 1, . . . 

// ip is non-erasing, then the system is called PDOL-system. 

In what follows, when referring to a DOL-system, we always mean a PDOL-system. 
In fact, for any DOL-system, it is possible to construct its elementary (not simplifiable) 
version which is a PDOL-system with an injective morphism [8] [9]. 

Clearly, if tp(a) = av for some a 6 A, v G A + , and if <p is non-erasing, then the 
language of the DOL-system (.4, p, a) is the language of the infinite fixed point ip"(a). 

There are several tools which help us to study the structure of the language of 
DOL-systems. We mention here two basic ones: the factor complexity and critical 
exponent. The factor complexity of a language is the function C(n) which counts the 
number of factors of length n. The factor complexity is usually obtained using the 
notion of special factors. 

Definition 2. Let w be a factor of the language C(G) of a DOL-system G over A. 
The set of left extensions of w is defined as 

Lext(w) = {a G A : aw G C(G)}. 
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If #Lext(w) > 2, then w is said to be a left special (LS) factor of C(G). 

In the analogous way we define the set of right extensions Rext(w) and a right 
special (RS) factor. If w is both left and right special, then it is called bispecial (BS). 

The connection between special factors and the factor complexity is described 
in [5]; the complete knowledge of LS, RS, or BS factors enables to find the factor 
complexity. 

The critical exponent is related to the repetitions in the language. Let w be a 
finite and nonempty word. Any finite prefix v of w u = www ■ • ■ is a power of w. We 
denote this by v = w r , in words v is r -power of w, where r = -M- Further, we define 
the index of w in a language C(G) of a DOL-system G as 

ind(w, G) = sup {r e Q : w r e £(£?)} . 

And finally, the critical exponent of the language C(G) is the number 

sup{ind(w) | w £ C(G)}. 

More details about critical exponent can be found, e.g., in [TO]. Examples of how 
knowledge of BS factors can help to compute it for a fixed point of a morphism are 
in p] and [12]. 

3 Explaining the main result 

Since the main result of this paper is a tool rather than a theorem, we demonstrate it 
using example morphisms. The tool has two ingredients. First one is a mapping that 
maps a BS factor to another one and so, applied repetitively, it generates sequences 
of BS factors. This mapping is defined by two directed labeled graphs. The other 
ingredient is a finite set of BS factors such that the sequences generated from them by 
the mapping cover all BS factors in a given fixed point. 

Let us consider the morphism <£e defined by h-> 012, 1 t-t 112, 2 h-> 102 and the 
corresponding DOL-system (A3,<fiE,0) with the fixed point ue. The factor 2112 is LS 
and has left extensions and 1. If we apply the morphism ipE on this structure - both 
on the factor and its two extensions -, the resulting factor <^e(2212) is no more LS 
since the respective extensions <£e(0) = 012 and (^e(I) = 112 end in the same letter 
2. In order to obtain a LS factor, we have to cut off the longest common suffix of the 
new extensions, here it is 12, and append it to the beginning of the y>E-image of the 
factor: the result is the LS factor 12^e(2212) with left extensions and 1. We can 
proceed in the same manner and obtain another LS factor 12i^e(12)</?e(2112) again 
with the same extensions and 1. Clearly, the same process works for RS factors and 
right extensions. 

Let us formalize what we did in the previous paragraph. Instead of BS factors we 
will use a slightly different notion of BS triplets ((a, b), v, (c, d)), where v is a BS factor 
and (a, b) and (c, d) are unordered pairs of its left and right extensions, respectively. We 
assume that either avc and bvd or avd and bvc are factors. Thus, ((0, 1), 2112, (0, 1)) is 
a BS triplet in ue- In terms of the previous paragraph, we can get another BS triplet 
from this one, namely ((0, 1), 12<^e(2112), (0, 1)); we call this BS triplet the f -image 
of ((0, 1), 2112, (0, 1)). The fact that left extensions (0,1) result again in extensions 
(0, 1) with appending of 12 can be represented as a directed edge from vertex (0, 1) 
to vertex (0, 1) with label 12. The edge corresponding to the right extensions starts 
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in (0, 1), ends again in (0, 1) and is labeled by the empty word. Applying this idea 
on all possible pairs of left and right extensions gives us two directed labeled graphs 
depicted in Figure [T] We call these graphs graph of left and right prolongations. With 
these graphs in hand, it is easy to generate infinitely many BS triplets from a given 
starting one. 




Figure 1: The graphs defining the /-image for the morphism </je- 

The other ingredience of our method bears on the fact that all BS triplets can 
be generated by taking repetitively /-image of only finitely many initial BS triplets. 
Initial BS triplets are those which are not /-images of another BS triplet. For instance, 
((0, 1), 2112, (0, 1)) is not initial as it is the /-image of the BS triplet ((0, 1), 1, (0, 2)) 
which is initial. Later we will show how to find all the initial factors for a given fixed 
point. For the case of ue, we have eight initial BS triplets: 

((0,1), 121, (0,1)), ((0,1), 12, (0,1)), ((0,1), 21, (0,1)), ((0,1), 2, (0,1)), 
((1,2), 1,(1,2)), ((0,2), 1,(1, 2)), ((0,2), 1,(0, 2)), ((1, 2), 0, (1, 2)). 

The vertices of the graphs from Figure [l] are just all pairs of distinct letters, but 
the situation is not that simple for all morphisms. In fact, it happens for graph of left 
(right) prolongations only if the respective morphism is suffix- free (prefix- free) . This 
case when the morphism is both prefix- and suffix-free has been already solved in |2|, 
where not only describe the authors all BS factors, but they also give a formula for 
the factor complexity. 

Let us consider the morphism <ps defined by H y 0012, 1 i— > 2, 2 i— > 012 and the 
corresponding DOL-system (As,ip8,0). Clearly, the morphism is not suffix- free. Let v 
be a LS factor with left extensions (1, 2). If we apply the morphism as above, we have 
a problem: the longest common suffix of factors (/>s(l) = 2 and </?s(2) = 012 is 2 and so 
we do not know what are the left extensions of the LS factor 2ips(v). A solution is to 
consider left extensions longer than one letter, in such a case we say left prolongation 
instead of left extension. Clearly, the factor 1 is always preceded by 0. Hence, let us 
consider left extensions (01,2). Now, <^s(01) = 00122 is no more a suffix of tp2 = 012 
and so we know the new left extensions: again (01, 2). In this way we can construct a 
complete graph defining the respective /-image, the result is in Figure [3] (the notation 
will be explained later). 

To prove that a proper finite set of pairs of left and right extensions of arbitrary 
length always exists is not trivial. It is not simple even to describe the properties such 
sets should posses so that they define a correct /-image. We will call such sets left 
and right forky sets, see Definition |20| It may happen that a finite forky set does not 
exist and so our method fails. Therefore we will have to put some restriction on the 
DOL-systems considered: we will assume that the systems are circular and non-pushy. 
These notions are explained in the following section. 
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Finally, we can now state the main result of this paper: Given a circular non- 
pushy DOL-system with an aperiodic fixed point, there exist finite left and right forky 
sets defining two directed graphs and a finite set of initial BS triplets such that the 
corresponding /-image applied repetitively on the initial BS-triplets generates all BS 
factors. 

4 Forky sets and initial factors 
4.1 Circular and non-pushy DOL-systems 

Any factor of a fixed point of a morphism (p can be decomposed into (possibly incom- 
plete) ^-images of letters. For example, in the case of </?e we have: 012f0 is a factor 
of </?e(0)<^e(2), i.e., 0f210 is composed by (^E-images of and 2. We denote this using 
bars, i.e., 012] 10. The decomposition may not be unique. For instance 210 is always 
decomposed as 2| 10 but we do not know whether 2 is a suffix of v?e(0) or <£e(1) (it 
cannot be a suffix of <^e(2) since 22 is not a factor of ue). In the case of the factor 1, 
we do not even know where to place the bar if not at all. 

A factor can have more than one decomposition; however, if there is a common 
bar for all these decompositions, this bar is called a synchronizing point. Coming back 
to our example, 210 has a synchronizing point between 2 and 10, formally we say that 
(2, 10) is a synchronizing point of 210. 

Definition 3 (Cassaigne [TJ). Let tp be a morphism with a fixed point u, tp mjective 
on C(u), and let w be a factor of u. An ordered pair of factors (tui, W2) is called a 
synchronizing point of w if w — W\W2 and 

Vui,U2 G A* , (V1WV2 G ip(C(u)) => viwi g ip(C(u)) and V2W2 € ip(C(u))). 

We denote this by w = wi\ s W2- 

Definition 4. A DOL-system G = (.4, tp, w) is circular if tp is infective on C(G) and if 
there exists D £ N such that any v 6 C(G) with \v\ > D has at least one synchronizing 
point. This D is called a synchronizing delay. 

Some examples of both circular and non-circular DOL-systems follow. 

Example 5. The system G — (A2, <Ptm,0) is circular with a synchronizing delay 
4- It is clear that any w G £(G) containing 00 or 11 has the synchronizing point 
w = ■ ■ ■ 0| s ■ ■ • or w = ■ • ■ X I s 1 • ■ • . To see that, let us consider a word w of length 4 
not containing these two factors. Without loss of generality, assume that w begins in 
1, then w — 1010. This word can be decomposed into </?tm(0) and </Stm(1) in exactly 
two ways: |10|10| and 1 1 01 1 . But the latter one is not admissible since it arises as the 
<p>TM-image of 000 which is not an alement of£(G). 

Example 6. The system G = (■42,^,0), where <p(0) = 01,y>(l) = 11, is not circular. 
Indeed, for all n the word l n has no synchronizing point since it can be decomposed as 
1 1 1 1 1 1 1 ■ ■ ■ and 1 1 11 1 11 1 

This example is very simple since the respective infinite fixed point 011111 ■ ■ • is 
eventually periodic. However, there are also aperiodic non-circular systems. 
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Example 7. The system G = (Aa,<p,0), where ip(0) = 010,(^(1) = 22, ip(2) = 
11, is not circular. The argument is the same as in the previous example, since the 
words 1™ are for all n £ N elements of C{G). However, the infinite word ^"(O) = 
0102201011110102- ■ • is aperiodic. 

One can notice that the languages in the both non-circular examples contain an 
arbitrary power of 1. It is not just a coincidence but a general rule. 

Theorem 8 (Mignosi and Seebold [131 ) . If a DOL-system is k-power-free (i.e., C(G) 
does not contain the k-power of any word) for some k > 1, then it is circular. 

Thus, non-circular fixed points must have infinite critical exponent, in fact, they 
must contain an unbounded power of some word. 

Theorem 9 (Ehrenfeucht and Rozenberg [13]). Given a DOL-system G = (A,tp,w), 
if C(G) contains a k-power for all k £ N, then G is strongly repetitive, i.e., there 
exists a nonempty v £ L(G) such that v £ C(G) for all I £ N. 

We see that non-circular systems have very special properties. Furthermore, the 
morphism of a non-circular system cannot be even primitive. A morphism tp over A 
is primitive if there is k £ N such that ip k (a) contains b for all a, b £ A. 

Theorem 10 (Mosse [15 ). Any DOL-system G = {A, (p, a) with ip infective on Q and 
primitive is circula^ 

No matter how non-circular systems seem to be bizarre, there is no known finite 
algorithm which would decide whether a given general DOL-system is circular or not. 
Of course, if the respective morphism is primitive, it is easy to prove it in finite steps. 
Later on we also prove that if the system is non-pushy, then the circularity is equivalent 
to repetitiveness which is decidable. 

Example 11. An example of non-primitive but circular morphism is the one given by 
i — y 0010, 1 i — y 1 . This is the Chacon morphism \16T/ and 5 is its synchronizing delay. 

4.2 Non-pushy DOL-systems 

The following two definitions and the lemma are taken from |14j . 

Definition 12. Let G = (A,(p,w) be a DOL-system. A letter b £ A has rank zero if 
L{Gb), where Gb = (A,ip,b), is finite. 

Definition 13. A DOL-system G — (A,tp,w) is pushy if for all n £ N there exists 
v £ C(G) of length n which is composed of letters that have rank zero; otherwise G is 
non-pushy. If G is non-pushy, then q(G) denotes 

q(G) — ma,x{\v\ | v £ C(G) is composed of letters that have rank zero}. 

Lemma 14. 

1. It is decidable whether or not an arbitrary DOL-system is pushy. 

2. If G is pushy, then C(G) is strongly repetitive (see Theorem^. 

3. If G is non-pushy, then q(G) is effectively computable. 

2 In the article |15) the circular systems are called "recognizable". 
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4- It is decidable whether or not an arbitrary DOL-system is strongly repetitive. 

Corollary 15 (Krieger [ID]). Let G = {A,tp,a),a £ A, be a non-pushy DOL-system 
and let u = tp"(a) be an infinite fixed point of ip. Then there exists a non-erasing 
morphism ip' and an effectively computable C £ N such that u — (<p') u (a) and for all 
v £ £(G) with \tp'(v)\ — \v\ we have \v\ < C. 

This means that for any word v of length at least C we have |</s(u)| > M + 1- More 
generally, if \v\ > KC then \ f(v)\ > \v\ + K. We will use this in the proof of the main 
theorem of the following subsection. 

In the sequel, we always suppose that the G = [A, tp, a) is such that ip' can be 
taken equal to ip (in fact ip' is just a power of <p, see the proof in [IQ]). This is without 
loss of generality since the language is the same. 

4.3 Forky sets 

Our aim is to define properly the notion of /-image introduced in Section [3] As 
explained, we need to have two directed labeled graphs defined on unordered pairs of 
left and right prolongations. Since left and right extensions usually refer to letters and 
the vertices of our graphs might be pairs of words, we give the following definition. 

Definition 16. Let u be an infinite word and w its factor. The set of left prolonga- 
tions of w is the set 

Lpro(ii)) = {i)6 A + 3 vw S £(u)}. 
In an analogous way we define the set of right prolongations Rpro(io). 

The sets Lpro(io) and Rpro(io) are, in general, infinite. 

Our aim is to specify a suitable finite sets Bl and Br of (unordered) pairs of left 
and right prolongations such that it allows to define correctly an /-image of all triplets 
((lOi, 102), V, (103, 104)), where v is a BS factor, (101,102) a pair of its left and (11)3,104) 
a pair of its right prolongations from Bl and Br, respectively. The /-image defined 
by the sets Bl and Br are to be defined as a BS triplet ((w[, w' 2 ), v' , (w' 3 , w 4 )), where 
(w'i,w' 2 ) and (w' 3 , u> 4 ) are again in Bl and Br and the factor v' = fL(w[,w' 2 )p(v)fR(w' 3 ,w' i ) 
is BS. The mappings /l and /r are defined as follows. 

Definition 17. Let ip be a morphism over A and let (vi, V2) be an unordered pair of 
words from A + . We define 

ft{vi,V2) = the longest common suffix of ip(v±) and ip(v2), 
/r (wi , «2 ) = the longest common prefix of ip(v±) and 95 (t?2 ) . 

The purpose of the following definitions is just to describe "good" choices of Bl 
and Br. 

Definition 18. Let (101,102) and (vi,?^) be unordered pairs of words. We say that 
(i) (w\,W2) is a prefix ('suffix,) of (vi,V2) if either u)i is a prefix (suffix) of vi and 

W2 of V2, or wi is a prefix (suffix) of V2 and W2 of v\; 
(ii) (101,102) and (1)1,1)3) are L-aligned if 

{v\ — uwi or wi = uvi) and (V2 = u'w2 or W2 = 1/1)2) 

or 

(ui = UW2 or W2 = uvi) and (1)2 = tt'lDi or Wi = u V2) 
for some words it, it'. 
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Analogously, we define pairs which are R-aligned. 



Example 19. The pairs (01, 0) and (001, 10) are L-aligned, while (01, 0) and (Oil, 10) 
are not L-aligned. Schematically, the notion of L-aligned pairs of words is depicted in 
Figure [J| 



given two pairs of words: 



and 



L-aligned: 




not L-aligned: 



Figure 2: L-aligned and not L-aligned pairs of words. 

Definition 20. Let ip be a morphism with a fixed point u. A finite setBh of unordered 
pairs (wi,W2) of nonempty factors ofu is called L-forky if all the following conditions 
are satisfied: 

(i) the last letters of wi and W2 are different for all (u>i, W%) 6 Bl, 

(ti) no distinct pairs (wi, W2) and (w' 1 ,w' 2 ) from Bl are L-aligned, 

(Hi) for any vi,V2 £ C(u) \ {e} with distinct last letters there exists (101,102) £ Bl 
such that (101,102) and (v±,V2) are L-aligned, 

(iv) for any (101,102) £ Bl there exists (101,11)2) £ Bl such that 

(w' 1 f L (w 1 ,W 2 ),W2fL(w 1 ,W2)) 

is a suffix of (tp(wi),(p(w 3 )). 
Analogously we define an R-forky set. 

Since the definition may look a bit intricate, we now comment on all the conditions. 
Condition (i) says that wi and 102 are left prolongations of LS factors (note that all 
pairs of words 11)1,11)2 £ £(u) are prolongations of the empty word, i.e., elements of 
Lpro(e)). Condition (ii) is required to avoid redundancy in Bl- Condition (Hi) ensures 
that any two left prolongations of any LS factor are included in Bl in the following 
sense: if we prolong or shorten them in a certain way we obtain a pair from Bl- And, 
finally, Condition (iv) is there because of the definition of the /-image: we want to be 
able to apply it repetitively. Note that due to (ii) and (Hi) the pair (w[, w 2 ) from (iv) 
is uniquely given. Note also that if (i) is satisfied and the words from all the pairs of 
Bl are of the same length, then (ii) and (Hi) are satisfied automatically. 
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Example 21. Consider the morphism ips from Section^ deifned by i — >■ 0012, 1 i->- 
2,2 h> 012. This morphism is injective and primitive and so, by Theorem \1C\ the 
respective DOL-system is circular. One can easily prove that 3 is a synchronizing delay 
(note that all factors containing 2 has a synchronizing point • • • 2| • • • .) 
Since ips is prefix-free, we get that the set 

B K = {(0,1), (0,2), (1,2)} 

is R-forky. However, this set is not L-forky: Condition (iv) is not satisfied for any 
pair since ys(l) is a su ffi x °f VsC^) which is a suffix of <ps(0). To remedy this, we 
consider left prolongations one letter longer which are ending in 1 and 2. Since the 
list of all factors of length 2 reads 

00,01,12,20,22 

the new pairs are 

(0,01), (0,12), (0,22), (2, 01). (2) 

For these pairs conditions (i) - (Hi) are again satisfied. But (iv) is not satisfied for 
(0, 12) since f L (0, 12) = 012 and (¥>s(0)(012) _ \<ps(12)(012) -1 ) = (0, 2) has no suffix 
in list (|2|. Hence, we have to prolong 12 again. There is only one possibility, namely 
012. The resulting set 

Si = {(0,01), (0,012), (0,22), (2,01)} 

is then L-forky since now we get (p s (0)(012) _ \ </> s (012)(012) _1 ) = (0,00122) with a 
suffix (0,22) G B L . 

Theorem 22. Let if be a morphism on A with a fixed point u = ip"{a). If (A,<p,a) 
is circular non-pushy system, then it has L-forky and R-forky sets. 

Proof. Set M = DC, where D is a synchronizing delay and C is the constant from 
Corollary |15| Define 

Bl = {(wi,U)2) : u>i,W2 G £(u), \wi\ = \w 2 \ = M, 

and the last letters of wi and W2 are different}. (3) 



We claim that Bl is L-forky. Conditions (i) - (iii) from Definition 20 are trivially 
fulfilled. It remains to prove (iv). 

It is clear that we must have |/£(wi,ti)2)| < D for any (wi,w 2 ) since fL(wi,w 2 ) 
without the last letter does not have any synchronizing point. Now, it suffices to 
realize that for any w of length M we have > \w\ + D — M + D and so 

(</?(wi)(/l(wi, w 2 ))~ x , ip(w 2 )(fL(wi, w 2 )) _1 ) has a suffix in Bl- The proof of existence 
of an R-forky set is perfectly the same. □ 

This proof does not give us a general guideline how to construct L-forky and R- 
forky sets since, as we have recalled earlier, we do not have an effective algorithm 
computing a synchronizing delay for a general morphism. Moreover, the L-forky set 
constructed in the proof is usually too huge. As in the case of our example morphism 



(ps from Example 21 (for this morphism C = 2 and D = 3), there usually exists a 
smaller L-forky set. 

Remark 23. The techniques of the proof of Theorem \22\ are the same as those of the 
proof of Lemma 11 in IWI - This Lemma, however, is concerned with the notion of 
critical exponent. 



9 



Definition 24. Let <p be a morphism with a fixed point u and let Bl be an L-forky 
set. We define the directed labeled graph of left prolongations GL^f L ' as follows: 

(i) the set of vertices is Bl, 

(ii) there is an edge from (wi,W2) to (w3,Wi) if (w3/l(u>i, W2), W4/l(»i, W2)) is a 
suffix of (<p(wi), ip(w2)). The label of this edge is fh{w\,W2)- 

In the same manner we define the graph of right prolongations GRjf . 

As a straightforward consequence of the definition of forky sets (especially of Con- 
dition (iv)) we have the following property of the graphs. 

Lemma 25. Each vertex in a graph of left and right prolongations has its out-degree 
equal to one. 

Consequently, any long enough path in the graph ends in a cycle and any component 
contains exactly one cycle. 

Example 26. The graphs GL^ and GR^""' for ips and for the sets Bl and Br 
from Example \21\ are in Figure^ 



Figure 3: The graphs G~L {Bl) and GR (Bn) for the morphism ip s . 

Definition 27. Let ip be a morphism on A with a fixed point u and let Bl and Br 
be L-forky and R-forky sets, respectively. A triplet ((toi, W2), v, (u>3, 104)) is called a 
bispecial (BS) triplet in u if (wx, 11)2) € Bl, (103,101) € Br and W1VW3, W2VW4 6 £(u) 
or W1VW4, W2VW3 G £( u )- 

Lemma 28. Let <p be a morphism with a fixed point u, let Bl be an L-forky and Br 
an R-forky set and let T = ((tt>i, W2), v, (W3, W4)) be a bispecial triplet of u. Let us 
denote by 

(i) gx(iui, W2) the end of the edge of GL^, starting in (wi,W2), 
(ii) gR(ws,W4) the end of the edge o/GR^ Bii ' 1 starting in (W3, W4). 








012 
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Then 

7" = (g L (u)i,W 2 ), f L (w ± , W2)<p(v)f R (w3, W4.),gR(W3,W4.)) 

is also a bispecial triplet of u. 

Definition 29. Denote B = (Bl,Br). The bispecial triplet T' from the previous 
lemma is called the /g-image of a bispecial triplet T = ((wi,w 2 ),v, (u>3,u>4)). 

Example 30. Consider again the morphism <ps- Let B = (Bl,Br), where Bl and 
Br are those from Example \21\ Then ((0, 012), 0, (0, 1)) is a bispecial triplet since both 
001 and 01200 are factors. Its f B -image reads ((0, 22), 0120012, (0, 2)) for we have 
51,(0,012) = (0,22), MO, 012) = 012, g R (0, 1) = (0,2), and f R (0,l) = e. 



Condition (iv) from Definition 20 of forky sets allows us to get a compact formula 
for the (/g) n -image, i.e., /g-image applied repetetively n times. 

Lemma 31. Let (p be a morphism and let ((101,102),!), (11)3,11)4)) be a bispecial triplet 
for some forky sets B — (Bl,Br). Then for all n 6 N it holds that its (/b)™ -image 
equals 

(gl(wi,w 2 ), f L ('fi n ' 1 (u} 1 ), ifi n ~ 1 (w 2 ))^"{v)f R (ifi n ~ 1 (w3), ^ n ~ 1 (w 4 )), gR(w 3 , w 4 )). 



4.4 Initial BS factors 

From the previous subsection, we know how to get a sequence of BS factors from some 
starting one: we just apply /e-image repetitively. The goal of the present subsection is 
to prove that for circular systems there exists a finite set of initial BS factors (triplets) 
such that any other BS factor is an (/g) n -image of one of them. 

Definition 32. Let p be a morphism infective on £(u), where u is its fixed point, let 
Bl and Br be L- and R-forky sets, and T = ((w\,w 2 ),v, (1^3,^4)) a bispecial triplet. 
Assume, without loss of generality, that Wivw3,w 2 vw4 S £(u). An ordered pair of 
factors (vi,v 2 ) is called a BS-synchronizing point of T if v = viv 2 and 

Vlti, u 2 ,U3, Ui G A* , (U1W1VW3U3, G tp(£(u)) 

UlW 2 Vl, U 2 W 2 Vl, W2W3M3, V 2 W4U4 £ tp(£(u)). 

We denote this by v — vi\i, s v 2 . 

The notion of BS-synchronizing point is weaker than the one of synchronizing 
point: it holds that if v = v-l\ s v 2 , then v = Vi\b s v 2 - The other direction is not true as 
it follows from this example: 

Example 33. Given a morphism >-> 010, 1 210, 2 220, the factor has no syn- 
chronizing point. On the other hand, if we take it as the bispecial triplet ((1, 2), 0, (0, 2)) 
it has the BS-synchronizing point 0|i, s . 

Definition 34. Let ip be a morphism with a fixed point u which is injective on £(u). 
A bispecial triplet T = ((wi, W2), v, (w>3, W4)) is said to be initial if it does not have 
any BS-synchronizing point. 

Definition 35. Let T = ((wi, UI2), v, (W3, W4,)) be a bispecial triplet which is not ini- 
tial and let (vi,v 2 ), (113,114), . . . , (v 2 m-i,v 2m ) be all its BS-synchronizing points such 
that \vi\ < \v3\ < ■■■ < \v 2m -i\. The factor v\ is said to be the non-synchronized 
prefix, v 2m the non-synchronized suffix and the factor (vi)~ 1 v(v 2m )~ 1 is called the 
synchronized factor ofT. 
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Now we state the main theorem of this section. 

Theorem 36. Let {A, ip, a) be a circular non-pushy DOL-system, Bl and Br its L- 
forky and R-forky set, and u = tp u (a) infinite. Then there exists a finite set X of 
bispecial triplets such that for any bispecial factor v there exist a bispecial triplet T G X 
and n G N such that ((u>i, W2), v, (W3, 11)4,)) = (/b)"(T) for some (wi,w 2 ) G Bl and 
(w3,Wi) G Br. 

Proof. Let X be the set of initial bispecial triplets. The fmiteness of I is a direct 
consequence of the definition of circularity: elements of X cannot be longer than the 
synchronizing delay. The rest of the statement follows from the fact that any non- 
initial triplet has at least one /g-preimage. 

To prove this, it suffices to realize that the synchronized factor of W2), v, {wz,Wi)) 
has unique (p-preimage v' (possibly the empty word) and that the non-synchronized 
prefix (resp. suffix) must be equal to fL,(wi,w' 2 ) for some (w[,W2) G Bl (resp. to 
fR(w' 3 ,w' 4 ) for some (1113,104) G Br). 

□ 

Example 37. We now find the set X for us with B = (Bl,Br), where Bl and 
Br are from Example \21\ the graphs of prolongations are in Figure [3j The only BS 
factors without synchronizing points are e and since any other factor contains the 
letter 2 (and hence it has a synchronizing point) or is not BS. It remains to find all 
corresponding triplets: 

((0,01),e,(l,2)), ((0,01), c,(0, 2)), ((0, 012), e, (0, 1)), ((0, 012), e, (0,2)), 
((0,012), e,(l,2)), ((0,22), e,(0,l)),. ((2, 01), 6, (0,2)), ((0, 012), 0, (0, 1)). 

Since we are usually interested in nonempty BS factors, we can replace the bispecial 
triplets containing e with their f&-images and get: 

((2, 01), 2, (0,2)), ((2, 01), 20, (0,1)), ((0, 22), 012, (0, 2)), 
((0,22), 0120, (0,1)), ((0,012), 012, (0,2)), ((0, 012), 1, (0, 1)). 

There are only 6 bispecial triplets since ((0, 012), e, (0, 1)) and ((0, 012), e, (1, 2)) have 
the same f^-image and so do ((0, 01), e, (0, 2)) and ((2, 01), e, (0, 2)). 

5 Infinite special branches 

In the preceding section we have described a tool allowing us to find all BS factors. 
It requires some effort to construct the graphs GL and GR and to find all initial 
bispecial triplets, but it can be done by an algorithm. However, even if we have all 
these necessities in hand, it may be still a long way to the complete knowledge of the 
structure of all BS factors. Nevertheless, there is a class of special factors which can 
be identified directly from the graphs GL and GL, namely, the prefixes (or suffixes) of 
the so-called infinite LS (or RS) branches. 

Definition 38. An infinite word w is an infinite LS branch of an infinite word u if 
each prefix of 'w is a LS factor of u. We put 

Lext(w) = O Lext(-u). 

v prefix of w 

Infinite RS branches are defined in the same manner, only that they are infinite to the 
right. 
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Here are some (almost) obvious statements on infinite special branches in an infinite 
word: 

Proposition 39. 

(i) If u is eventually periodic, then there is no infinite LS branch of u, 

(ii) if u is aperiodic, then there exists at least one infinite LS branch of u, 

(Hi) if u is a fixed point of a primitive morphism, then the number of infinite LS 
branches is bounded. 

Proof. Item (i) is obvious, (Hi) is a direct consequence of the fact that the first dif- 
ference of complexity is bounded [17] . The proof of item (ii) is due to the famous 
Konig's infinity lemma [T5] applied on sets Vi, V2, ■ ■ ., where the set 14 comprises all 
LS factors of length k and where Vi G Vi is connected by an edge with V2 G Vi+i if V\ 
is prefix of 112 • □ 

Imagine now that we have an L-forky set Bl and an infinite LS branch w. There 
must exist (vi, V2) £ Bl such that viw and V2W are factors for any prefix w of w. Such 
a pair is called an infinite LS pair. 

Definition 40. Let (v\,V2) be an element of an L-forky set corresponding to a fixed 
point u of a morphism tp. The ordered pair ((wi, V2), w) is called an infinite LS pair if 
for any prefix w of w the words V]_w and V2W are factors of u. 

Further, we define the /g^-image of an infinite LS pair ((vi, V2), w) as the infinite 
LS pair «2),w'), where (v[,v'2) = gh(vi, v%) and w' = /t(«i, «2)v 5 ( w )- 

Having the fs L -image of an infinite LS branch, we are again interested in its ft3 L - 
preimage. 

Lemma 41. Let (A, ip, a), a £ A, be a circular DOL-system with an infinite fixed point 
u = tp w (a) and let Bl be its L-forky set. Then any infinite LS pair is the fs L -image 
of a unique infinite LS pair. 

Proof. Let ((fi, V2), w) be an infinite LS pair and let D be a synchronizing delay of ip. 
Then any prefix of w of length at least D has the same left-most synchronizing point 
(wi, (wi)~ 1 w). Since such w is LS, then u>i must be a label of an edge in GL^ L ' 1 
whose end- vertex is (vi,V2) and starting one in (v[,v<2). The infinite word (u;i) _1 w 
must have a unique y-preimage w'. □ 

Since any infinite LS pair ((v\, V2), w) has an /e^-preimage, the in-degree of the 
vertex (vi, V2) in the graph of left prolongations GL^f 1 *' must be at least one. 

Corollary 42. Let (A,<p,a) be a circular DOL-system with a fixed point u and an 
L-forky set Bl- If ((vi, V2), w) is an infinite LS pair then (vi,V2) is a vertex of a cycle 
m GL^ L) . 

We know that the number of infinite LS pairs in a fixed point of a primitive 
morphism is finite (see Proposition |39[ ); the following proposition says that this is 
true even if we weaken the assumption from primitive to circular and non-pushy. The 
proof of the proposition will, moreover, give us a simple method of how to find all 
these infinite LS pairs. 
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Theorem 43. Let {A,ip,a),a £ A, be a circular DOL-system with an L-forky set Bl 
such that <p u (a) = u is infinite. Then there exist only a finite number of infinite LS 
pairs. 

Proof. Let ((-ui, D2), w) be an infinite LS pair and let (vi,V2) be a vertex of a cycle 
m GL^f Let us assume that the cycle is of length k. Then it is labeled by words 
/l(vi,«2), /l(pl(«i, v 2 )),••• , v 2 )) where ^2) is the label of the edge 

starting in (vi,V2) (see Figure Hi. We distinguish two cases: 



h(gL(vi,v 2 )) 




Figure 4: The notation from the proof of Theorem [43] 
(a) At least one of the labels of the cycle is not the empty word. Applying k times 



Lemma 



41 



we can find the infinite LS pair ((i>i, 1)2), w') such that w is the {fe L ) fc -image 
of ((vi,V2),w r ), i.e., 

W = fL{g k L ~ 1 (v 1 ,V 2 )) ■ ■ ■ (p k ~ 2 (fL(gL(vi,V2))(p k ~ 1 (fL(vi,V 2 )) lfi k (w') = Sip k (w'). 

v ' 

denoted by s 

Since w' can be expressed again as w' = Sip k (w") for some infinite LS pair ((«i, ^2), w"), 
we have 

k / \ 2k / \ 3k 1 ll\ 
W — Sip (S)ip (S)ip (W ). 

Continuing in this construction one can prove that Sifi k (s) ■ ■ ■ ip nk (s) is a prefix of w 
for all ngN. Therefore, we get 

k { \ 2k { \ 3k { \ 
W = Sif (S)ip {S)ip (S) ■ ■ ■ . 

We have just shown that exactly one infinite LS pair corresponds to each vertex 
of the cycle. 

(b) Now assume that all the labels of the cycle are empty words. In such a case 
the fs L -image coincides with ip-image, meaning that (fe L Y -image of ((ui, 1)2), w) is 
{g J L (vi, V2), (y3 J (w)) for all j — 1, 2, ■ • ■ . We want to prove that w must be a periodic 
point of ip. Consider the directed graph whose vertices are the first letters of ip(b), b £ 
A, and there is an edge from b to c if c is the first letter of <p(b). Clearly, the first letter 
of w, say b, must be again a vertex of a cycle in this graph. Let £ be the length of this 
cycle. For reasons analogous to those above the (/s £ ) J -image and (fs L ) J *-preimage 
of w must also begin in b. Therefore, w contains the factor tp 1 (b) as a prefix for all 
j = 1, 2, . . . and this implies that w = (ip i.e., w is a periodic point of ip. 
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Since the number of vertices of GL^, and of periodic points is finite, the number 
of infinite LS pairs must be finite as well. □ 

The previous proof is also a proof of the following corollary which gives us a method 
of how to find all infinite LS branches. 

Corollary 44. Let (A, (p, a), a £ A, be a circular DOL-system, u = <^ w (a) infinite 
with Bl an L-forky set and let U2), w) be an infinite LS pair. Then either w is a 
periodic point of ip, i.e., 

w = <p e (w) for some I > 1, (4) 

and (vi, 1)2) is a vertex of a cycle in GL^f L ' labeled by e only, or w — sip e (s)ip 2e (s) ■ ■ ■ 
is the unique solution of the equation 

w = s(p (w), (5) 

where (1)1,1)2) is a vertex of a cycle in GL V containing at least one edge with a non- 
empty label, I is the length of this cycle and 

s = /i (SlT 1 («1,«2)) •••</? -2 (/i(ff£(vi,W2))^ -1 (/i(ui,U2)). (6) 

We demonstrate this method on an example morphism. 

Example 45. We consider the morphism 

tpp : 1 H- 1211, 2 i-> 311, 3 h-> 2412, 4 i-> 435, 5 h-> 534 (7) 

uiii/i u = <y3p(l). TTiis morphism is suffix- and prefix-free and so the set of all unordered 
pairs of distinct letters is L-forky. The graph of left prolongations is in Figure^ The 
morphism tpp has five periodic points 

^(i),^(4),^(5),(^r(2),(^r(3). 

It is easy to show that 

Lext(l) = {l,2,3,4,5},Lext(2) = {1, 4, 5}, Lext(3) = {1,4,5}, 

Lext(4) = {l,2,3},Lext(5) = {1,2,3}. 
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Looking at the graph of left prolongations depicted in Figure^ we see that </?p(4) and 
y?p(5) are not infinite LS branches as none of the vertices (1,2), (2,3) and (1,3) is a 
vertex of a cycle labeled by e only. Hence, only y>p(l), (^3p) w (2), (y>p) w (3) are infinite 
LS branches with left extensions 1,4,5. 

As for infinite LS branches corresponding to Equation (JHJl, in the case of our ex- 
ample, there is only one cycle which is not labeled by the empty word: the cycle between 
vertices (1,2) and (2,3). There are two (= the length of the cycle) equations corre- 
sponding to this cycle 

w = (pp(ll)ifip(w) and w = llip P (w). 

They give us two infinite LS branches 

^(11)^(11)^(11) ••■ , 
11^1(11)^(11) ••■ , 

the former having left extensions 1 and 2 and the latter 2 and 3. 

6 Assumptions and a connection with the criti- 
cal exponent 

The method of how to generate all BS factors of a given DOL-system we have described 
above bears on two facts: There exists L- and R-forky sets and the number of initial 
BS triplets is finite. The former was proved for circular and non-pushy systems and the 
later for circular ones only. Are these assumptions necessary or can they be weakened? 

Let us take a morphism i — > 001, 1 i — >■ 1 which is pushy and circular and its fixed 
point u starting in 0. If we try to construct an L-forky set as it has been defined in 
this paper, we will find out that it is not possible. A natural cantidate for veritices of 
the graph of left prolongations are pairs (0, 1") with n £ N, but any finite set of such 
pairs does not satisfy the property (iv) of Definition |20| So it seems that to assume 
the morphism being non-pushy is inevitable for existence of forky sets. However, if 
we relax the definition and enable the pairs of factors to be infintely long, we can 
find something like L-forky set even for this morphism: Define a directed graph of left 
prolongations such that it has only one vertex (0, • • • 111) and one loop on this vertex 
with label 1 and a directed graph of right prolongations with one vertex (0, 1) and a 
loop on it with empty label, then all BS factors in u are the /-images of BS-triplet 
((0, • ■ ■ 111), 0, (0, 1)), namely 0, 1<^(0), 1<^(1)^ 2 (0), ... 

Now, consider a morphism h-> 001, 1 h-> 11 and its fixed point u starting in 0. This 
morphism is non-pushy and non-circular. In this case, L- and R-forky sets exists, we 
can simply take {(0, 1)} and there is only one (nonempty) initial BS-triplet with no BS- 
synchronizmg point ((0, 1), 0, (0, 1)). Its /-images again reed 0, l<p(0), ly>(l)yj 2 (0), . . .. 
In fact, to prove that the set of initial BS-triplets is finite, we need to know only that 
there is not infinite numer of BS-triplets without BS-synchronizing point and it seems 
to be true even for non-circular morphisms. 

All considered morphisms for which our method does not work (or is not proved 
to work) have an infinite critical exponent. The following theorem says this is not a 
misleading observation but a general rule. 

Theorem 46. Let G = (A, <p, w) be a DOL-system. Then the critical exponent of 
C{G) is finite if and only if G is circular and non-pushy. 
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Proof. (=>): Circularity follows from Theorem [8] G being pushy is in contradiction 
with Lemma (14] thus, it is non-pushy. 

(<=): Suppose the critical exponent of £(G) is infinite and that G is circular and non- 
pushy. According to Theorem [9] there exists a non-empty factor v £ £(G) such that 
for all n £ N, u™ 6 £{G). Take the shortest factor v having such property. Since G is 
circular, there exists a finite synchronizing delay D. Take N £ N such that \v \ > D. 
Then v N contains a synchronizing point, i.e., v N = vi\ B V2- It is clear that v 
contains at least two synchronizing points, i.e., v N+1 — vi\ s V2V = Wi\ s V2. In general, 
v N+k contains k + 1 synchronizing points at fixed distances equal to \v\. Since tp is 
injective, it implies that there exists a unique 2 £ C{G) such that v N+k = pip(z k )s 
(for some factors p and s) and z k £ C(G) for all k > 0. According to the choice of v, 
it is clear that |</j(jz)| = \ z \ = M- Denote by £1(2) the set of letters occurring in z. It 
is clear that <p(£.i(z)) = Ci(v) and Va £ £1(2) we have |v5( a )| = 1- 

We can now repeat the process: take the factor 2 to play the role of factor v. Thus, 
we can find an infinite sequence of factors zq — z, zi, 22 . . . such that 93(^1(^+1)) = 
Ci(zk) and \zk\ = \z\ for all k > 0. Since A is finite, it is clear that there exists integers 
m/< such that C\(z m ) — jCi(ze). This implies that for all k the factor z^ is composed 
of letters of rank zero. This is a contradiction with G being non-pushy. □ 

7 Conclusion 

The tool we have introduced in this paper enables to construct an algorithm which 
can find all BS factors in a given circular non-pushy DOL-system so that it produces 
the graphs of prolongations and the set of initial BS factors - its slightly simplified 
version was implemented by Stepan Starosta using SAGE. The scatch of the algorithm 
is as follows: 

1. Decide whether the input DOL-system is strongly repetitive using the algorithm 
from [Tl]. If it is, then by the previous theorem and Theorem[9]the DOL-system 
is non-circular or pushy and our method does not work. If it is not, proceed 
with to steps. 

2. Construct L- and R-forky sets: the details of the construction are a bit technical 
but the basic idea is the same we used in Example |21| 

3. Find all initial BS triplets without any BS-synchronizing point. The fact that 
the system is circular ensures the algorithm stops after a finite number of steps. 
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